Towards Learning to Ignore Irrelevant State Variables

نویسندگان

  • Nicholas K. Jong
  • Peter Stone
چکیده

Hierarchical methods have attracted much recent attention as a means for scaling reinforcement learning algorithms to increasingly complex, real-world tasks. These methods provide two important kinds of abstraction that facilitate learning. First, hierarchies organize actions into temporally abstract high-level tasks. Second, they facilitate task dependent state abstractions that allow each high-level task to restrict attention only to relevant state variables. In most approaches to date, the user must supply suitable task decompositions and state abstractions to the learner. How to discover these hierarchies automatically remains a challenging open problem. As a first step towards solving this problem, we introduce a general method for determining the validity of potential state abstractions that might form the basis of reusable tasks. Weions that might form the basis of reusable tasks. We build a probabilistic model of the underlying Markov decision problem and then statistically test the applicability of the state abstraction. We demonstrate the ability of our procedure to discriminate among safe and unsafe state abstractions in the familiar Taxi domain.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Learning to Identify Irrelevant State Variables

When they are available, safe state abstractions improve the efficiency of reinforcement learning algorithms by allowing an agent to ignore irrelevant distinctions between states while still learning an optimal policy. Prior work investigated how to incorporate state abstractions into existing algorithms, but most approaches required the user to provide the abstraction. How to discover this kin...

متن کامل

First-Order Bayes-Ball

Efficient probabilistic inference is key to the success of statistical relational learning. One issue that increases the cost of inference is the presence of irrelevant random variables. The Bayes-ball algorithm can identify the requisite variables in a propositional Bayesian network and thus ignore irrelevant variables. This paper presents a lifted version of Bayes-ball, which works directly o...

متن کامل

Hearing Where the Eyes See: Children Use an Irrelevant Visual Cue When Localizing Sounds.

To reduce sensory uncertainty, humans combine cues from multiple senses. However, in everyday life, many co-occurring cues are irrelevant to the task at hand. How do humans know which cues to ignore? And does this ability change with development? This study shows the ability to ignore cross-modal irrelevant information develops late in childhood. Participants performed a sound discrimination ta...

متن کامل

Automatic and Instructed Attention in Learned Predictiveness

In novel situations, learning is biased towards information that has a degree of prior predictive utility. In human learning, this is termed the learned predictiveness effect and has proved critical in theorising about the role of attention in learning. Two experiments are reported in which the relative contribution of controlled and automatic processes to learned predictiveness are investigate...

متن کامل

Ignoring irrelevant stimuli in latent inhibition and Stroop paradigms: the effects of schizotypy and gender.

Latent inhibition (LI), poor evidence of learning following preexposure to a task-irrelevant stimulus, reflects the ability to ignore inconsequential events. Stroop interference represents a failure to inhibit processing of a task-irrelevant word when it is incongruent with the required naming of the word's print color. The apparent commonality between the two effects is in contradiction to the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004